Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 1 of 17 



Figure 1 



10 



PROCESSING 
ROUTINES (e.g. routines 
for multiple alignment, 
PCP motif generation, 
mining, etc.) 



16 



Data 

(e.g. sequence data, 
structural data, 
etc.) 



20 



12 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 2 of 17 



Figure 2 




PROVIDE MULTIPLE SEQUENCE 
ALIGNMENT (e.g., alignment for 
family of related sequences) 



34 



38 



PROVIDE SEQUENCE 
DATABASE (e.g., one or 
more lists of sequences) 



GENERATE PHYSICAL 
CHEMICAL PROPERTY (PCP) 
MOTIFS (i.e., PCP signatures) 



30 



MINE DATA USING PCP MOTIFS TO 
LOCATE RELATED SEQUENCE 
SEGMENTS (e.g., use window scoring function 
to detect related sequence segments) 



-36 



V 



DETERMINE WHICH PROTEINS OF THE 
SEQUENCE DATABASE MOST 
RESEMBLE THE PHYSICAL-CHEMICAL 
PROPERTIES OF THE ORIGINAL FAMILY 
OF SEQUENCES USED TO GENERATE 
THE MULTIPLE SEQUENCE ALIGNMENT 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial N(M 10/817,530 FUed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 3 of 17 

J 



Figure 3 



COLLECT DIVERSE RELATED 
SEQUENCES (e.g., known protein 
family desired to be used for query 
of sequence database) 




GENERATE MULTIPLE 
SEQUENCE ALIGNMENT 



PROVIDE THE MULTIPLE SEQUENCE 
ALIGNMENT (e.g., an alignment including 
a plurality of columns, each vertical column 

corresponding to a position horizontally in 
the multiple sequence alignment) 




32 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial N<£ 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 4 of 17 



Figure 4 



PROVIDE THE 
MULTIPLE SEQUENCE 
ALIGNMENT 



60 



,64 



PROVIDE MULTIDIMENSIONAL NUMERICAL 
PCP DESCRIPTORS FOR EACH AMINO ACID 

BASED ON THE PHYSICAL CHEMICAL 
PROPERTIES OF EACH AMINO ACID (e.g., the 
numerical PCP descriptors for each amino acid 
including a numerical descriptor for each of a plurality 
of Eigenvectors) 



DESCRIBE THE MULTIPLE SEQUENCE 
ALIGNMENT (e.g., each amino acid thereof) 
QUANTITATIVELY IN TERMS OF THE PCP 
DESCRIPTORS AS A SERIES OF (n) 
EIGENVECTORS 



62 



VERTICALLY ANALYZE THE QUANTITATIVELY 
DESCRIBED MULTIPLE SEQUENCE ALIGNMENT, 

ON A COLUMN BY COLUMN BASIS, AS A 
FUNCTION OF THE CONSERVATION OF THE PCP 
DESCRIPTORS IN EACH COLUMN IN THE SERIES 

OF EIGENVECTORS, DETERMINE THE 
CONSERVATION FN EACH COLUMN BASED ON 
STANDARD DEVIATION AND RELATIVE ENTROPY 



66 



DEFINING PCP MOTIFS AS CONSECUTIVE 
POSITIONS IN THE MULTIPLE SEQUENCE 
ALIGNMENT WHERE SIGNIFICANT 
CONSERVATION EXISTS (e.g., where the relative 
entropy exceeds a predetermined limit) 




68 



▼ 



GENERATE A SERIES OF PCP MOTIF 
PROFILE MATRICES FOR EACH PCP MOTIF 




70 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants) : Braun etal. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 5 of 17 



Figure 5 



PROVIDE THE MULTIPLE 
SEQUENCE ALIGNMENT 



PROVIDE MULTIDIMENSIONAL NUMERICAL PCP 
DESCRIPTORS FOR EACH AMINO ACID BASED ON 
THE PHYSICAL CHEMICAL PROPERTIES OF EACH 
AMINO ACID (e.g., the numerical PCP descriptors for 
each amino acid including a numerical descriptor for each 
of a plurality of Eigenvectors) 



84 



DESCRIBE THE MULTIPLE SEQUENCE ALIGNMENT QUANTITATIVELY 
IN TERMS OF THE PCP DESCRIPTORS RESULTING IN (N) DISTINCT PCP 

DESCRIBED ALIGNMENTS CORRESPONDING TO THE N 
EIGENVECTORS, EACH DISTINCT ALIGNMENT INCLUDING COLUMNS 
CORRESPONDING TO THE COLUMNS (i.e. positions) OF THE MULTIPLE 

SEQUENCE ALIGNMENT, WHEREIN EACH COLUMN INCLUDES 
NUMERICAL PCP DESCRIPTORS (i.e. corresponding to the eigenvector being 
defined) FOR EACH AMINO ACID (i.e., residue) IN THE COLUMN 



62 



ANALYZE EACH OF THE N PCP DESCRIBED ALIGNMENTS, 

COLUMN BY COLUMN, TO GENERATE CONSERVATION 
PROPERTY DATA FOR EACH COLUMN THEREOF (e.g., the 

conservation property data including the average value 
of each column and the related standard deviation for 
the column, and a relative entropy for the column) 



SCANNING (e.g., horizontally across sequential columns that were 
analyzed) THE CONSERVATION PROPERTY DATA (e.g., the 
relative entropy values generated for each column of the PCP 
described alignment being analyzed) TO DETECT CONSECUTIVE 
POSITIONS (e.g., corresponding to consecutively analyzed 
columns) WHERE THE RELATIVE ENTROPY EXCEEDS A 
PREDETERMINED LIMIT 



BASED ON THE DETECTED CONSECUTIVE POSITIONS 
AND USER SPECIFIED GAP AND MINIMUM LENGTH 
LIMITS, DETERMINING WHERE THE PCP MOTIFS ARE 
LOCATED IN THE MULTIPLE SEQUENCE ALIGNMENT 



92 



USER SPECIFIED GAP 
AND MINIMUM 
LENGTH LIMITS 



-90 



GENERATE A SERIES OF PCP MOTIF PROFILE MATRICES OR A MULTI- 
DIMENSIONAL MATRIX FOR EACH MOTIF (e.g., each motif defined by a plurality of 
matrices, with each matrix thereof corresponding to one of n eigenvectors, and further wherein 
each matrix includes a first level that includes average values for each position in the PCP motif 
for the corresponding eigenvector and related standard deviation, and a second level that gives 
the relative entropy of each position of the PCP motif for the corresponding eigenvector 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 6 of 17 



Figure 6 



100 



.102 



PROVIDE THEPCP 
MOTIF PROFILE 
MATRICES USED 
TO DEFINE EACH 
PCP MOTIF 



PROVIDE SEQUENCE 
DATABASE TO BE 
SEARCHED (e.g., a plurality 
of proteins) 



104 



CONVERT SEQUENCE DATABASE INTO 
SEARCHABLE FORM CORRESPONDING TO PCP 
MOTIF PROFILE MATRICES USING NUMERICAL 
PCP DESCRIPTORS (e.g., mxn matrices, where n is the 
number of Eigenvectors and m is the length of the 
sequence) 



106 



MATCHING VALUES OF THE PCP MOTIF PROFILE 
MATRICES TO WINDOWS OF THE SEQUENCES IN THE 
CONVERTED SEQUENCE DATABASE USING A POSITIONAL 
SCORING FUNCTION 



,108 



SELECT ONE OR MORE SCORED WINDOWS AS BEING 
BEST MATCHES FOR THE PCP MOTIF PROFILE MATRICES 
USED TO DEFINE EACH PCP MOTIF 



PROVIDE FILE INCLUDING VALUES AND/OR SEQUENCES 
OF THE SEGMENTS OF SEARCHED SEQUENCES THAT ARE 
BEST MATCHED TO THE PCP MOTIF PROFILE MATRICES 
USED TO DEFINE EACH PCP MOTIF (e.g., matched areas of 
proteins PCP characteristics to the family used to generate the 
multiple sequence alignment but which have different amino acid 
sequences) 



36 



110 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicant^): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 7 of 17 



Figure 7 



PROVIDE FILE INCLUDING VALUES AND/OR 
SEQUENCES OF THE SEGMENTS OF SEARCHED 

SEQUENCES THAT ARE BEST MATCHED TO 
THE PCP MOTIF PROFILE MATRICES USED TO 
DEFINE THE PCP MOTIFS 



.120 



LINK DATA FOR EACH FOUND SEGMENT FOR 
EACH PROTEIN IN THE SEQUENCE DATABASE 




DETERMINE AN OVERALL SIMILARITY 
DISTANCE SCORE FOR EACH PROTEIN OF THE 
SEQUENCE DATABASE BEING SEARCHED 
BASED ON THE SCORE FOR EACH FOUND 
SEGMENT AND RELATIVE TO WHAT A 
RANDOM SCORE FOR THE FOUND SEGMENT 
WOULD BE 



> 


f 


RANK THE PROTEI1 
DATABASE BEING SE; 
OVERALL PCP SIMILE 


SfS OF THE SEQUENCE 
\RCHED ACCORDING TO 
VRITY DISTANCE SCORE 


> 


f 


PROVIDE OUTPUT OF PROTEINS WITH THE 
HIGHEST SCORE 



124 




126 



128 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 8 of 17 



Figure 8 



PROVIDE STRUCTURE FOR A 
PLURALITY OF THE SEGMENTS 
THAT ARE BEST MATCHED TO THE 
PCP MOTIF PROFILE MATRICES USED 
TO DEFINE THE PCP MOTIFS 



142 



CALCULATE THE SEGMENTAL RMSD 
BETWEEN THE QUERY STRUCTURES 
AND THE STRUCTURES OF THE BEST 
MATCHED SEGMENTS 



146 



RANK THE PROTEINS OF THE 
SEARCHED SEQUENCE DATABASE 
ACCORDING TO THE STRUCTURAL 
SIMILARJTY 



150 



RANK THE PROTEINS OF SEARCHED 
SEQUENCE DATABASE ACCORDING 
TO THE STRUCTURAL SIMILARITY 
RANKING AND PCP SIMILARITY 
DISTANCE SCORING 



152 



40 



PROVIDE QUERY STRUCTURES RELATED 
TO THE SEGMENTS OF THE MULTIPLE 
SEQUENCE ALIGNMENT (e.g., one or more 
segments of the sequences aligned, the consensus 
sequence of the alignment, etc.) 



148 



PROVIDE SCORING (e.g., ranking) OF 
PROTEINS OF SEARCHED 
SEQUENCE DATABASE 
ACCORDING TO PCP SIMILARITY 
DISTANCE SCORING 



154 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 9 of 17 



> 

o ^ 

^ -s 

GO 



2 



< * 



£ T3 
O 

O -O 

GO ■ ' 

w 

Q- - 
< 



00 



CO 

CO 



oo 



*o r- m »o oo 

o o o o — o o 



— no on 
o — o o o 



oooooooooooo 



+1 +1 -4-1 4-1 +1 +1 4-1 4-1 4-1 

no — -d-r-oor-ioo^ 



- ^ fN 
vo «r~> vr> 

OOOOOOOOOOOO 



oooooooooooo 
oooooooooooo 



4-1 4-1 



4-1 4-1 



4-1 4-1 4-1 4-1 4-1 4-1 



oooNoooooNr-»ONOoONOOONON 



oooooooooooo 



r^ovo — v©t--oomcoo^j-m 
r^-r-vooooor^t^r^oor-NOoo 

oo'oooooooooo 



m^j-o — oo oo rs) oo o — -vooo 
vooovor-vor— oor^r-vo*/->oo 

ooo'ooo'oooooo 



* * 

o m ^ 

OS On 

o o 



o 



NO 



° o 



****** 

oo ~ m on 
oo on on r-«- oo on 
o o o o o o 



— NO — Tj"VOOvovovOvO(NrO 

ON ON OS ON ON OO ON ON ON OO ON ON 
OOOOOOOOOOOO 



< 



o 

S w 

3 i 



o 



ON 




- rs n >n 



^j- no 
no r- o 
(N (N n 



^ ^ O — 
oo On — — f— 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 10 of 17 



a 



cu 

o 



o 

2 



2 

IS 



CQ 
Q 



CO 



o 1 £ 
g & 

* c 

-a - 5 

O g 
-C CL 
CL c+_ 
— — O 

<D 2 c 

C3 w OS 

= ^ g E 

— o .s o 

-2 ^ -S o 
< W Q Q. tt! 



(30 g 

S § 
« .2 



u 
o 



.12 

o E— 

u a 



oo fN 

— <N m i 

On »— o o o on ^ m o • — ■ 

<N — | O O O — ; Tlj O ^ 



~ fN fN 

^ — ; rn — ; fN 
<N — fN — • 



— — — — oo _ 
_ , — — m - 



oC 

rs ^ fN — i 

~ ^ ^ ~ CN <N fN fN 

t~-„ ^ ON ^ — — ^ — 

vo r-" oC K on vcT 

fN ~ *o vo vo" fN >o 



o 




VO 
















o 


Ov 




10 


ro 








oo 


Co 




o 


o 


o 










m 


m 












o 




6 


o 


o 


fN 






vo 










v — / 






VO 


On 


to 


o 




r- 


fN 


r-» 


oo 


ON 


oo 


o 


o 






ON 


On 


m 


On 










oo 


oo 


r- 


r- 


r-> 


vo 



r- O -> . 

x, < a 2 



3 ^ >* 
^ 9 & * 



O On 

w > <y 



o 
a 



CL C/3 LU 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 11 of 17 



o 

On 

I 

&0 



0> 

Q 



O 

u 

c/3 



=3 

O 

a 
o 

2 



2 

o 

JD *3 to 
^ — <U 
C g JCJ 

.2 tsi: 

<U C! .1-h 
O CO fll 

a) ^-5 



on 

3 



CO O 

o *C 
3 >^ 
c x 
o o 

X w 



8 ^ * 

I «s 

O <t> 

.5 co 

J2 a. 

a. g 

■S J 

CL < 



CO 

c N 

N w 

n — ' <y 

Co <v> 

II 

CO O 

o O 

§ £ 

> CO 



o 

CL 

J 

c2 



2 I 

C§ CL 
CO < 



1 J 

"si 

P 

*C0 « 

£ .S 
^ S3 



oo 

^ CM — 



os — — — 



fN 



On — CnJ — <N co 

rsj — | — ; 

ro ro CO <N 



_* *n — oj — i cn} — '• 

' — ; ^ >n ti n ■ — ' ^ 
"d 13 TD 13 O *0 T3 cj <S cd 



ON On 2 2 

K ^ 0~ o 
vo v© ~- 

*/~T */~f aC on — oo on oo 
r{ cn so no oC vo irT \o 
u-T «n «r? cN »o 



On — — 







































/— s 




o 


On 


in 






m 






O 


o^ 




O 


O 






co 


CO 


m 


CO 












o 


o, 


O 


o 




o 


cn 




CM 


















m 


r- 




oo 


ro 


Ov 




On 




On 


oo 


o 




On 


m 






r- 










On 


NO 


NO 


NO 


NO 




«n 



8 
I 

t-H 

r| 



Q 



c 

'£> 
o 

CL 



T3 
O 

o 
CQ 

a 

CL, 



C 

c 

o 

CO 

o 

J 

o 

c 

o 



o & 

CJ c 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME ^ 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 12 of 17 



REST AVA" *-"<=cOPY 



N atural f raquency of Ami no ac id Natural frequency of Amino acid 




Natural frequency of Amino aci d Natural frequency of Amino a rid 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817.530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 13 of 17 



Figure 1 1 



SEQ ID NO: 1 



El 

E2 
E3 
E4 
E5 



D 



+ 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 14 of 17 



Figure 12A 

0.O -| 

Qfl ■ 

8 °" 

0.6 • 
% OS- 




0 0.1 02 0 3 0.4 05 0* 0.T OA OP 1 



Score range (motif #1) 



Figure 12B 




Score range (motif #2) 



Figure 12C 



Figure 12D 




Score range (motif #3) 



Score range (rnotif #4) 



Figure 12E Figure 12F 




0 0.1 02 0.3 0.4 OS 0.& 0.7 Q& O0 1 D 0.1 02 03 04 OS 06 0.7 OS O.D 1 



Score range (motif #5) Score range (motif #6) 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicant(s): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mall No.: EV 201890706 US REPLACEMENT SHEET Sheet 15 of 17 



Figure 12G 



Figure 12H 




at 02 o.3 a.4 0.9 aa ot oa as 
Score range (motif #7 ) 



01 02 0.3 04 0.5 00 OT 08 OS 

Score range (motif #8) 



Figure 121 




0 01 02 0.3 04 Q.S 00 O.T QB 00 I 



Score range (motif #9) 



Figure 12J 





00 -I 






oa ■ 
o? • 




8 


06 - 
Hi - 




Fraction of 


04 . 
03 - 
02 ■ 
0l- 
Q ■ 





o oi 02 0.3 04 0,5 aa a? oa aa t 

Score range (motif #10) 



Figure 12K Figure 12L 




0 01 02 0.3 04 0.S OA 0T 00 00 I 0 Q l 02 0.3 04 QjJ 06 O? 00 00- I 



Score range (mottft 1 1 ) Score range (motif # 1 2) 



Title: PHYSICAL-CHEMICAL PROPERTY BASED SEQUENCE MOTIFS AND METHODS REGARDING SAME 
Applicants): Braun et al. 

Serial No.: 10/817,530 Filed: April 2, 2004 Docket: 265.00400101 

Express Mail No.: EV 201890706 US REPLACEMENT SHEET Sheet 16 of 17 



Figure 13 
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Seq ID No: 2 lbix LYEDPPDQKTSPSGKPATtKICSWNVDGLRAlw IKKKGLDWVKE . EA 

Seq ID No: 3 lako MKFVSFNINGLRAR P. .HQLEAIVEkHQ 

Seq ID No: 4 3dni LKIAAFNIRTFGetkmsnatLASYIVRIVR . . . R 

Seq ID No: 5 li9yA ydpiheyvnhelrkrenef seHKNVK iEFVASYNLNG . . Q S ATTK . LENWLF . Pe 



2 

lbix I P DILCLOETM . . . . CSENKL.P. . AELQEL PGLSHQYWS . APSD 

lako P DVIGLQETK . . . . VHDDMF . P . 1EEVAKL G. . YNVFYH . G - . . 

3dni Y DIVLIQEVP . . . . DSHLVAvG . . KLLDYL nqddpNTYH . YVVSePLGR 

li9yA IntpladiYVVGFQEIM al tsADPAkr reweSCVKRLlnakctsapaYVQLRSGOL . V . . . 



3 4 

lbix KEGYSGVGLLSR . . .QCP LKVSY GIGDE . . . . EHDQE . GRVIVAEFD 

lako QKGHYGVALLTK . . . ETP IAVRRGFPGD DEEAqRRI I MAE I P . . .Si 

3dni nSYKERYLFLFRpnkVSV LDTYQ . YDDGccgnDSFSR . . EPAVVKFSshst k 

li9yA GTALMIF. . . Ckesclpsi knVEGTVKK tGLGN . KGAVAIRFD . . .ye 



5 6 

lbix . . SFVL VTA YVPNA GRGLV . . RLEYRORW DEAFRKFLKG LA . . . S . RK . IPLVI 

lako IgNVTVINGYFPQGESRDHpiKFPAKAQFYQNLQNYLET EL . . . KrDN . PVLI 

3dni vkEFAIVALHSAPS DAVAEINSLYDVYLD VQqkwH . LN . DVML 

li9yA dtGLCFITSHLAAGY TNYDERDHDYRTIASglrf rrgrSI . . . F.NHd lYVVW 



lbix 
lako 
3dni 
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EXO_ M.thermautotrophicus 9 
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EXG_Synechocyeti8 4.6.8 
EXO_M. tuberculosis 4.6.8 
EXO_JUulgidus 3.6.6 

EXO_C.crescentuB 8,9 
EXO.M.Ion 6.B.9 
EXO_Atumefaciens 6,8 
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EXO H. influenzae 4,6 
EXO_E.coli 4.6 
EXO_S.enterica 4.6 
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EXO_V.cholerae4.6 
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